Skip to main content
Close announcements banner

Change data partitioning on Microsoft Azure HDInsight

When Change Data Partitioning is enabled, the Replicate Change Tables in Hive are partitioned by the partition_name column. Data files are uploaded to the Blob/ADLS storage, according to the maximum size and time definition, and then stored in a directory under the Change Table directory. Whenever the specified partition timeframe ends, a partition is created in Hive, pointing to the Blob/ADLS storage.

Information about the partitions is written to the attrep_cdc_partitions Control Table.

 

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!